# Multimodal Document QA
Glm 4vq
4-bit quantized version of GLM-4V-9B, supporting multimodal multilingual understanding with memory usage under 9G, outperforming multiple mainstream models
Image-to-Text
Transformers Supports Multiple Languages

G
nikravan
440
33
Layoutlm Invoices
A multimodal LayoutLM model fine-tuned for invoice and other document QA tasks, supporting discontinuous text recognition
Image-to-Text English
L
aslessor
16
2
Layoutlm Invoices
A document QA model fine-tuned based on the LayoutLM architecture, specifically designed for processing discontinuous text recognition in invoices and other documents
Text-to-Image
Transformers English

L
magorshunov
145
57
Layoutlm Invoices
A document QA model fine-tuned based on the LayoutLM architecture, specifically designed for handling invoice and other document QA tasks
Text-to-Image
Transformers English

L
impira
75.42k
198
Featured Recommended AI Models